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DETAILED ACTION 

Response to Amendment 

This Office action has been issued in response to amendment filed August 
3 rd , 2006. Claims 1-20 are pending . Applicant's arguments have been carefully 
and respectfully considered, but they are not entirely persuasive, as will be 
discussed in more detail below, even in light of the instant amendments. 
Accordingly, this action has been made FINAL. 



Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in 
public use or on sale in this country, more than one year prior to the date of application for patent in 
the United States. 

Claims 1-8, 10, and 12-20 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Wilson (U.S. Patent no. 5,557,734, hereafter referred to as 
Wilson). 

Independent Claims 

With respect to independent claims 1 (and 12), 

"A distributed data cache [disclosed, e.g. in FIG. 5, elements MEMO- 
MEMS] coupled to one of a plurality of heterogeneous processing nodes, 
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wherein the heterogeneous processing nodes [disclosed in FIG. 5 
Processors 46, 47, and 481 are coupled by a programmable 
interconnection network fdisclosed, e.g. in column 1, lines 10 and 111 
having a network root [FIG. 1, element 21, "Host"] and a plurality of 
crosspoint switches, FIG. 5, elements 50, 511 the distributed data cache 
as m e mory or r e gister fil e comprising: a first cache memory unit [e.g. FIG. 
5, element MEM8] having a plurality of cache ports; and a plurality of data 
buses connected with the cache memory unit, wherein each of the 
plurality of data buses is connected with the plurality of cache ports of the 
cache memory unit" [As noted in FIG. 5, the cache memory MEM8 
discloses two separate ports (plurality of cache ports) which are 
connected to two different data buses] 

With respect to independent claim 12, 

"An apparatus for transposing transferring a plurality of data values 
arranged in a matrix, [Wilson teaches this limitation, e.g. in column 31, 
lines 20-24] the apparatus coupled to one of a plurality of heterogeneous 
processing nodes, [disclosed in FIG. 5 Processors 46, 47, and 48] 
wherein the heterogeneous processing nodes are coupled by a 
programmable interconnection network [disclosed, e.g. in column 1, 
lines 10 and 11] having a network root [FIG. 1, element 21, "Host"] and 
a plurality of crosspoint switches, [FIG. 5, elements 50, 51]_f/?e apparatus 
comprising: a plurality of cache memory units, [disclosed, e.g. in FIG. 5, 
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elements MEM0-MEM8] each cache memory unit having a plurality of 
cache ports; and a plurality of data buses, each data bus connected with a 
different one of the plurality of cache ports from each of the cache memory 
or register file units." [As noted in FIG. 5, the cache memory MEM8 
discloses two separate ports (plurality of cache ports) which are 
connected to two different data buses] 



With respect to independent claim 20, 

"A method for transposing transferring a plurality of data values arranged 
in a matrix, [Wilson teaches this limitation, e.g. in column 31, lines 20- 
24] the matrix including a plurality of heterogeneous processing nodes 
[disclosed in FIG. 5 Processors 46, 47, and 48] coupled by a 
programmable interconnection network [disclosed, e.g. in column 1, 
lines 10 and 11] having a network root [FIG. 1, element 21, "Host"] and 
a plurality of crosspoint switches, [FIG. 5, elements 50, 51] the method 
comprising: retrieving a first subset of data values from the plurality of data 
values from a memory unit; [Wilson discloses in column 15, lines 18- 
20, "...the four cache memories MEM5-MEM8 are capable of storing 
arbitrary segments of matrix data retrieved from external memory 
23." Wilson teaches retrieving a first subset of data values from the 
plurality of data values (arbitrary segments of matrix data) from a 
memory unit (external memory).] simultaneously transferring the first 
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subset of data values to a plurality of data buses, wherein each data value 
of the first subset is transferred to a different one of the plurality of data 
buses; [Wilson next discloses in the abstract "The system further 
includes a controller for each chip that sequences a burst of 
consecutive rows of a data matrix from the external cache burst 
memory, to be stored in either the cache memory associated with 
each of the processing units or routed directly to the processors 
included in each processing unit." In this disclosure, Wilson teaches 
the simultaneous transfer of the first subset of data values (burst of 
consecutive rows of a data matrix) to a plurality of data buses 
(routed directly to the processors included in each processing unit). 
Since there is a different data bus connected to the plurality of 
processing units (see FIG. 1) the plurality of data values are each 
transferred to a different one of the plurality of buses (that lead to the 
different processing units).] and simultaneously loading the first subset 
of data values from the plurality of data buses to a first cache memory unit 
having a plurality of cache ports lines, wherein each cache port receives a 
data value from a different one of the plurality of data buses without any 
lat e ncy " [Wilson also teaches simultaneously loading the first subset 
of data values from the plurality of data buses to a first cache 
memory unit (...to be stored in either the cache memory associated 
with each of the processing units). The examiner interprets loading 
to mean stored. It is inherent for a cache memory unit to have a 
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plurality of cache lines when there are pluralities of data values 
stored within the cache. FIG. 5 shows each cache port (see cache 
memory units MEMO-8) receiving data from a different one of the 
plurality of data buses (shown in Transpose In Bus connecting to the 
cache memory units through S10-17).] 

Dependent Claims 

With respect to claim 2, 

"The distributed data cache of Claim 1, further comprising a data path 
adapted for processing data and having at least one data input and at 
least one data output, wherein the data input and data output are 
connected with the plurality of data buses" is disclosed in column 8, lines 
11-18. 

The examiner interprets claim 2 as a data path, which can process 
data (or move data from and to), and has at least one data input and one 
data output that are connected to the plurality of data buses. 

Wilson discloses in column 8, lines 15-18, FIG. 1 elements 5, 21, 
22 "The host 21 can independently read or write data, such as image data 
obtained from an external source (not shown), or write instructions in 
memories 23 through bus 22 for use by chip 5." Wilson teaches that a 
data path (read/write data from memory through bus) has a data input 
(host) and a data output (memory). Also, since the memories are 
connected to the Processing Chips the data bus is also connected to the 
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plurality of buses (transpose in and transpose out buses). Furthermore, it 
is also inherent that a data path (or data bus) has at least one data input 
and at least one data output in order for signal or data to travel from one 
element to another. Therefore the data input and the data output are 
inherently connected with the plurality of data buses because they are a 
component of a data bus. 

With respect to claim 3, 

"The distributed data cache of Claim 2, further comprising a multiplexer for 
alternately connecting the data input with each of the plurality of data 
buses" is disclosed in column 14, lines 64-65. 

The examiner interprets this claim to mean that the distributed data 
cache of claim 1 comprises a multiplexer for connecting data input with 
each of the plurality of buses. 

Wilson discloses in column 14, lines 64-65, FIG. 7, elements 71-72, 
"The data line 74, which is the output from multiplexer 73, and data line 
59, provided as inputs to multiplexer 75..." FIGs. 5 and 7 disclose how the 
data is connected to the plurality of buses 

With respect to claim 4, 

"The distributed data cache of Claim 2, further comprising a multiplexer for 
alternately connecting the data output with each of the plurality of data 
buses" is disclosed in column 14, lines 64-65. 
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The examiner interprets this claim to mean that the distributed data 
cache of claim 1 comprises a multiplexer for connecting data output with 
each of the plurality of buses. 

Wilson discloses in column 14, lines 64-65, FIG. 7, element 75, 
"The data line 74, which is the output from multiplexer 73, and data line 
59, provided as inputs to multiplexer 75..." FIGs. 5 and 7 disclose how the 
data is connected to the plurality of buses. 

With respect to claim 5, 

"The distributed data cache of Claim 1, further comprising a plurality of 
data address generators connected with a memory unit and the plurality of 
data buses without latency" is disclosed in column 24, lines 39-46. 

The examiner interprets this claim to mean the distributed data 
cache of Claim 1, which further comprises a plurality of data address 
generators connected with a memory unit and the plurality of data buses. 
The examiner interprets this claim sans the limitation "without latency" for 
reasons stated above. 

Wilson discloses "The word on lines CTL12 correspond to a starting 
address. The output on line 103 is sent to the address inputs of all cache 
memories MEMS-MEMS. Control lines CTL13 provides a word, which is 
constant for this data cycle, that is added to the output of counter 121 by 
adder 126, which then outputs an address for memories MEM0-MEM4" 
(column 24, lines 39-46). 
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With respect to claims 6 and 7, 

"The distributed data cache of Claim 5, wherein the plurality of data 
address generators are adapted to retrieve a plurality of data values from 
the memory unit and communicate the plurality of data values to the 
plurality of data buses directly without any latency due to registering" and 
"The distributed data cache of Claim 6, wherein the plurality of data 
address generators are adapted to simultaneously communicate the 
plurality of data values to the plurality of data buses, wherein each of the 
plurality of data values is communicated to a different one of the plurality 
of data buses," respectively are both disclosed in column 26, lines 26-33, 
FIG. 13 element 24C. 

The examiner interprets claim 6 to mean the distributed data cache 
of claim 5, which further comprises a plurality of data address generators 
that are able to retrieve (or access, communicate, etc) a plurality of data 
values from the memory unit and communicate these plurality of data 
values to the plurality of data buses. The examiner interprets this claim 
sans the limitation "without any latency due to registering" for reasons 
stated supra. 

The examiner interprets claim 7 to mean the distributed data cache 
of claim 6, wherein the plurality of address generators have the ability to 
simultaneously (or within the same clock cycle) communicate the plurality 
of data values to the plurality of data buses, wherein each of the plurality 



Application/Control Number: 10/628,036 Page 
Art Unit: 2189 

of data values is communicated to a different one of the plurality of data 

buses. 

Wilson discloses "The data is burst in sequential row order, or, in 
other words, a row at a time, wherein each clock pulse to external memory 
23 causes the next consecutive address (e.g., the next row of data in 
memory 23) to be referenced, the data stored therein being sent via lines 
24C to, for example, the cache memories MEM5-MEMS, or to processors 
46, 47, and 48 for processing during that clock cycle" is disclosed in 
column 26, lines 26-33, FIG. 13 element 24C. Wilson references 
(communicates or retrieves) a row of data (plurality of data values) to a 
plurality of data buses (lines 24C of FIG. 13) simultaneously (burst and 
during the same clock cycle). 

With respect to claim 8, 

"The distributed data cache of Claim 7, wherein the first cache memory 
unit is adapted to simultaneously load a plurality of data values from the 
plurality of data buses, such that each of the plurality of data values is 
loaded in a different one of the plurality of cache lines of the first cache 
memory unit through the same port" is disclosed in column 2, lines 44-48 
and the abstract 

The examiner interprets this claim to mean the cache memory units 
(MEMO-8 elements of FIG. 5) can be made to simultaneously load (burst) 
one of the sets of data values (one of a consecutive row of a data matrix) 
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from the plurality of data buses (Note: both Transpose In and Transpose 
Out Buses shown in FIG. 5 of Wilson are each a plurality of buses). 
Additionally, the apparatus can be made to load a data value to a different 
one (of a plurality of) cache lines of the first cache memory unit through 
the same port. 

Wilson discloses "The system further includes a controller for each 
chip that sequences a burst of consecutive rows of a data matrix from the 
external cache burst memory, to be stored in either the cache memory 
associated with each of the processing units or routed directly to the 
processors included in each processing unit" (abstract). Wilson also 
discloses in column 2, lines 44-48, "Usage of cache memory internal to 
the microprocessor allows instructions and data to be stored in different 
caches and be simultaneously addressed thereby improving the memory 
bandpass internally." Since data is stored in different caches, it is 
transferred through a plurality of buses. Also, FIG. 1 shows since different 
caches are addressed simultaneously, there are different cache lines 
through which data values are loaded in (stored). 

With respect to claim 10, 

"The distributed data cache of Claim 1, further comprising at least one 
additional cache memory unit also having a plurality of cache lines, 
wherein each cache line of the additional cache memory unit is connected 
with the plurality of data buses," is disclosed in column 20, lines 54-59. 
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The examiner interprets this claim to mean (the distributed data 
cache of Claim 1) which comprises at least one additional cache memory 
unit that also has a plurality of cache lines, wherein each cache line of the 
additional cache memory unit is connected with the plurality of buses. It is 
inherent that a cache memory unit has a plurality of cache lines. 

Wilson discloses in column 20, lines 54-59, FIG. 5 "As shown in 
FIG. 5, each of the eight transpose-bus-out lines 55 are connected via a 
respective line 100 to a respective output selector S56 in each of the 
processing units 29A-29H. Control line CTL3 in only a selected one of the 
eight processing units 29A-29H can be activated to cause the contents of 
the respective cache memories MEM0-MEM7 to be transferred to 
transpose-bus-out 55 via selectors S10-S17." Wilson teaches in FIG. 5 
that each of the cache memory units' (at least one additional cache 
memory unit: see elements MEMO-8) cache lines (inherent) is connected 
with the plurality of data buses (transpose out/in buses through element 
S10-17and S20-28). 

(Note: The following is in regard to claims 13, 15, and 16) 

With respect to claim 13, 

"The apparatus of Claim 14, wherein the plurality of data address 
generators are adapted to sequentially communicate the plurality of sets 
of data values with the plurality of data buses without any register latency," 

claim 15, 
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"The apparatus of Claim 14, wherein the plurality of data address 
generators are adapted to sequentially communicate the plurality of sets 
of data values with the plurality of data buses without any register latency," 
claim 16, 

"The apparatus of Claim 15, wherein the plurality of data address 
generators are adapted to simultaneously communicate the data values of 
each set of data values to the plurality of data buses without any register 
latency" are all disclosed in column 26, lines 21-34, FIG. 13 element 24C. 



The examiner interprets claim 13 to mean (the apparatus of claim 
12), which further comprises a plurality of data address generators that 
are able to retrieve (or access, communicate, etc) a plurality of data 
values from the memory unit and communicate these plurality of data 
values to the plurality of data buses. The examiner interprets the claim 
limitation "without any latency" as the amount of time it takes Wilson to 
move the data. 

The examiner interprets claim 15 to mean the plurality of address 
generators have the ability to sequentially communicate the plurality of 
sets of data values with the plurality of data buses. The examiner 
interprets the claim limitation "without any register latency" for reasons 
stated supra. 

The examiner interprets claim 16 to mean the plurality of address 
generators have the ability to simultaneously (or within the same clock 
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cycle) communicate the plurality of sets of data values to the plurality of 
data buses. 

Wilson discloses "The data is burst in sequential row order, or, in other 
words, a row at a time, wherein each clock pulse to external memory 23 
causes the next consecutive address (e.g., the next row of data in memory 
23) to be referenced, the data stored therein being sent via lines 24C to, 
for example, the cache memories MEM5-MEMS, or to processors 46, 47, 
and 48 for processing during that clock cycle" is disclosed in column 26, 
lines 26-33, FIG. 13 element 24C. The plurality of data address 
generators are referenced again in column 26, lines 21-25. Wilson also 
discloses, "the contents of the transpose-bus-out 55 will be transferred to 
lines 24C that are connected to external memory 23" (column 20 lines 66- 
67). Lines 24C are used as a plurality of buses each leading to a different 
memory port. 

As per claim 15, Wilson teaches that the data is burst in sequential 
row order, or, in other words, a row at a time (sequentially and in "data 
sets" or rows), and later sent via lines 24C (communicated to a plurality of 
buses). 

As per claim 16, Wilson discloses a row of data (plurality of sets of 
data values) is communicated (referenced or sent) to a plurality of data 
buses (lines 24C of FIG. 13) simultaneously (burst and during the same 
clock cycle). 
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With respect to claim 14, 

"The apparatus of Claim 13, wherein the plurality pf data values comprises 
a plurality of sets of data values" is disclosed in column 5, lines 64-66 and 
column 8 lines 51-53. 

The examiner interprets this claim to mean a plurality of data values 
comprises a plurality of sets or groups of data values. 

Wilson outlines a "respective row of N data bits" in column 5, lines 
64-66. Wilson shows that data is grouped over a plurality of rows (set of 
data values). Wilson also discloses "one or more columns of array data 
stored in memory" in column 8, lines 51-53, which illustrate a set of data 
values (one or more columns of array data). 

With respect to claim 17, 

"The apparatus of Claim 16, wherein each set of data values is a matrix 
row" is disclosed in column 4, lines 64-68. 

The examiner interprets this claim to mean that each set of data 
values is a matrix row. 

Wilson discloses in column 4, lines 64-68, "The apparatus 
comprises: a first memory means having L rows and M columns, for 
storing a data matrix having P columns, where P is an integer greater than 
M, wherein the data matrix is stored in the first memory means in a 
plurality of segments." Wilson outlines a "respective row of N data bits" in 
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column 5, lines 64-66. Wilson shows that data is grouped over a plurality 
of rows (set of data values). 



With respect to claim 18, 

"The apparatus of Claim 16, wherein each set of data values is a matrix 
column" is disclosed in column 4, lines 64-68. 

The examiner interprets this claim to mean that each set of data 
values is a matrix column. 

Wilson discloses in column 4, lines 64-68, "The apparatus 
comprises: a first memory means having L rows and M columns, for 
storing a data matrix having P columns, where P is an integer greater than 
M, wherein the data matrix is stored in the first memory means in a 
plurality of segments." Wilson discloses "one or more columns of array 
data stored in memory" in column 8, lines 51-53, which illustrate a set of 
data values (one or more columns of array data). 

With respect to claim 19, 

"The apparatus of Claim 16, wherein each of the cache memory units is 
adapted to simultaneously load one of the sets of data values from the 
plurality of data buses, such that each data value of the set of data values 
is loaded in a different one of the plurality of cache ports of the cache 
memory unit or register file" is disclosed column 2, lines 44-48 and the 
abstract. 
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The examiner interprets this claim to mean the cache memory units 
(MEMO-8 elements of FIG. 5) can be made to simultaneously load (burst) 
one of the sets of data values (one of a consecutive row of a data matrix) 
from the plurality of data buses (Note: both Transpose In and Transpose 
Out Buses shown in FIG. 5 of Wilson are each a plurality of buses). 
Additionally, the apparatus can be made to load a data value from a set of 
data values to a different one of the plurality of cache memory units. As 
stated supra, the examiner interprets cache port to simply identify the 
means through which a data bus connects to a cache memory unit. 

Wilson discloses "The system further includes a controller for each 
chip that sequences a burst of consecutive rows of a data matrix from the 
external cache burst memory, to be stored in either the cache memory 
associated with each of the processing units or routed directly to the 
processors included in each processing unit" (abstract). Wilson also 
discloses in column 2, lines 44-48, "Usage of cache memory internal to 
the microprocessor allows instructions and data to be stored in different 
caches and be simultaneously addressed thereby improving the memory 
bandpass internally." Since data is stored in different caches, it is 
transferred through a plurality of buses. Also, since different caches are 
addressed simultaneously, there are different cache ports through which 
data values are loaded in (stored). 
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Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 102 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 



The factual inquiries set forth in Graham v. John Deere CO., 383 U.S. 1, 
148 USPQ 459 (1966), that are applied for establishing a background for 
determining obviousness under 35 U.S.C. 103(a) are summarized as follows: 

1 . Determining the scope and contents of the prior art. 

2. Ascertaining the differences between the prior art and the claims at 
issue. 

3. Resolving the level of ordinary skill in the pertinent art. 

4. Considering objective evidence present in the application indicating 
obviousness or nonobviousness. 



Claim 9 is rejected under 35 U.S.C. 103(a) as being obvious over Wilson 
(U.S. Patent no. 5,557,734) in view of Okuda et. al (US Patent No. 6,535,452 
hereafter referred to as Okuda). 

With respect to claim 9, Wilson teaches "The distributed data cache of 
Claim 1 ..." as stated supra. 

Wilson, however, does not disclose expressly "...wherein the number of 
cache lines of the first cache memory unit are equal to the number of data 
buses." 
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Okuda discloses, "...respective blocks of the memory blocks 
corresponding to the m respective data pins, and are simultaneously activated, 
the activation of any one of the address selection lines connecting the data bus 
lines to a corresponding one of the m respective blocks and resulting in the n 
data pieces being input/output to/from the corresponding one of the m respective 
blocks" (Okuda abstract). The examiner interprets cache lines as memory blocks 
corresponding to respective data pins. 

Wilson and Okuda are analogous art because they are from the same field 
of endeavor, that being cache or memory data distribution. 

At the time of the invention it would have been obvious to a person of 
ordinary skill in the art to incorporate a distributed data cache of claim 1 in which 
the number of cache lines or memory blocks of the cache memory unit is equal to 
the number of data buses outlined above to arrive at claim 9. 

The motivation for doing so would have been obvious based on the 
enhancement of speed of memory access, which is disclosed in columnl, lines 
32-42, which recites, "For the purpose of speeding up the operation speed of 
semiconductor memory devices, generally, the operation of core circuits inside 
the memory devices need to be made faster. It is difficult, however, to speed up 
the operation of core circuits because of limitations such as wire delays. When a 
fixed number of data bits are to be serially input/output upon a single access, 
provision may be made not only to read data corresponding to the plurality of DQ 
pins in parallel from the memory core, but also to read serially output data in 
parallel from the memory core." 
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Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of the invention having the teachings of Wilson and Okuda before 
him/her to combine Okuda with Wilson for the benefit of having the number of 
cache lines equal to the number of data buses to obtain the invention as 

specified in claim 9. 

Claim 11 is rejected under 35 U.S.C. 103(a) as being obvious over Wilson 
(U.S. Patent no. 5,557,734) with "Computer Organization and Design" 2 nd edition, 
by Hennessy and Patterson offered as extrinsic evidence (hereafter referred to* 
as Hennessy). 

With respect to claim 11, Wilson teaches "The distributed data cache of 
Claim 10..." as stated supra. 

Wilson, however, does not disclose expressly "...wherein the total number 
of cache memory units is equal to the number of cache lines in each cache 

memory unit." 

Hennessy discloses, "In a set-associative cache, there are a fixed number 
of locations (at least two) where each block can be placed; a set-associative 
cache with n locations for a block is called an n-way set-associative cache. An n- 
way set-associative cache consists of a number of sets, each of which consists 
of n blocks... "(page 569). Blocks in this definition is analogous to cache line. 

Wilson and Hennessy are analogous art because they are from the same 
field of endeavor, that being ordering or controlling memory commands. 
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At the time of the invention it would have been obvious to a person of 
ordinary skill in the art to design a distributed data cache which has an N-way, M- 
line set associative cache design implementation wherein N is equal to M 
outlined above to arrive at claim 11. Each cache way is treated like a small direct 
mapped cache. The examiner interprets figures 6 and 7 to teach an 
implementation of a direct-mapped cache. 

The motivation for doing so would have been obvious based on 
Hennessy, page 570, "The advantage of increasing the degree of associativity is 
that it usually decreases the miss rate..." 

Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of the invention having the teachings of Wilson and Hennessy before 
him/her to combine Hennessy with Wilson for the benefit of having a M-way, M- 
set associative distributed data cache to obtain the invention as specified in claim 
11. 

ARGUMENTS CONCERNING PRIOR ART REJECTIONS 
1 ST POINT OF ARGUMENT : 

With respect to the arguments on page 7 of the applicant's 
remarks, the examiner believes that in the clarified rejection supra with 
respect to independent claims 1, 12, and 20 each show "a plurality of 
heterogeneous processing nodes... coupled by a programmable 
interconnection network having a network root and a plurality of crosspoint 
switches." 
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Furthermore, the examiner notes that Wilson does in fact teach that 
crosspoint switch 50 of Figure 5 is coupled to the processing units via the 
data bus lines. The examiner interprets a network root as a host computer 
system as seen connected to a network of processing units or nodes as 
shown in Figure 1 of Wilson. Although the examiner believes that Wilson 
does in fact teach the recited architecture of the applicant's claim, the 
examiner would also like to point the applicant to MPEP 2131.05 which 
states, "A reference is no less anticipatory if, after disclosing the invention, 
the reference then disparages it. The question whether a reference 
"teaches away" from the invention is inapplicable to an anticipation 
analysis." 

CONCLUSION 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of 
time policy as set forth in 37 CFR 1 . 1 36(a). 

A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply is 
filed within TWO MONTHS of the mailing date of this final action and the advisory 
action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory 
action is mailed, and any extension fee pursuant to 37 CFR 1.136(a) will be 
calculated from the mailing date of the advisory action. In no event, however, will 
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the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

Direction of Future Correspondences 

Any inquiry concerning this communication or earlier communication from 
the examiner should be directed to Horace L Flournoy whose telephone number 
is (571) 272-2705. The examiner can normally be reached on Monday through 
Friday 8:00 AM to 5:30 PM (ET). 

Important Note 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Reginald G. Bragdon can be reached on (571) 272-4204. 
The fax phone numbers for the organization where this application or proceeding 
is assigned is (703) 746-7239. 

Information regarding the status of an Application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or PUBLIC 
PAIR. Status information for unpublished applications is available through Private 
Pair only. For more information about the PAIR system, see http://pair- 
direct.uspto.gov . Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll- 
free). 
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Any inquiry of a general nature or relating to the status of this application 
or proceeding should be directed to the receptionist whose telephone number is 
(571)272-2100. 

Horace L. Flournoy Reginald G. Bragdon 

Patent Ex^riner Supervisory Patent Examiner 

Art unit: 2189 Technology Center 2100 



